Introduce datatable.unique.names policy for duplicate handling in setnames() #4044 by venom1204 · Pull Request #7674 · Rdatatable/data.table

venom1204 · 2026-03-20T09:32:40Z

Apologies for opening a new PR—there was an issue with the previous one. I’ve incorporated the requested changes, while keeping the rest unchanged.

Kindly review it when you have the time. Also, sorry for the delay; I was occupied with other commitments.

Thank you!

codecov · 2026-03-20T09:39:54Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 99.04%. Comparing base (7db13b9) to head (471ab20).

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #7674   +/-   ##
=======================================
  Coverage   99.04%   99.04%           
=======================================
  Files          87       87           
  Lines       17031    17049   +18     
=======================================
+ Hits        16868    16886   +18     
  Misses        163      163

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2026-03-20T09:54:24Z

No obvious timing issues in HEAD=issue4044

Generated via commit 471ab20

Download link for the artifact containing the test results: ↓ atime-results.zip

Task	Duration
R setup and installing dependencies	2 minutes and 53 seconds
Installing different package versions	23 seconds
Running and plotting the test cases	4 minutes and 3 seconds

ben-schwen · 2026-04-20T14:11:44Z

 test(2367.6, fread(file(f)), data.table(), warning="Connection has size 0.")
 unlink(f)
+
+#4044


please dont only write the issue number, but what issue is fixed. this provides information without needing to search for the issue on github.

The tests seem ok but an example of DT = data.table(a=1:2, b=c('x', 'y')) would have been easier to grasp (and also less typing/code)

ben-schwen · 2026-04-20T14:22:55Z


+process_name_policy = function(names_vec) {
+  policy = getOption("datatable.unique.names")
+  if (is.null(policy) || policy == "off") return(names_vec)


no ultra happy with this double option for "off". I would either eliminate NULL or "off" (leaning towards eliminating NULL though)

ben-schwen · 2026-04-20T14:25:27Z

    \item{\code{datatable.enlist}}{Experimental feature. Default is \code{NULL}. If set to a function
      (e.g., \code{list}), the \code{j} expression can return a \code{list}, which will then
      be "enlisted" into columns in the result.}
+    \item{\code{datatable.unique.names}}{A character string, default \code{NULL} (same as \code{"off"}). 


we should also document the other options, e.g. warn, error etc.

ben-schwen · 2026-04-20T14:31:23Z

+    msg = paste0("Duplicate column names created: ", brackify(dups), ". This may cause ambiguity.")
+
+    switch(policy,
+      warn = warningf(msg),


see comment above from previous review. this will trip with

DT = data.table("a%d" = 1, b = 2) options(datatable.unique.names = "warn") setnames(DT, "b", "a%d") # Error in sprintf(gettext(fmt, domain = domain, trim = trim), ...) : # too few arguments

Hence, you need to pass format strings like warningf("%s", msg)

ben-schwen · 2026-04-20T14:32:00Z


 5. `tables()` can now optionally report `data.table` objects stored one level deep inside list objects when `depth=1L`, [#2606](https://github.com/Rdatatable/data.table/issues/2606). Thanks @MichaelChirico for the report and @manmita for the PR

+6. `setnames()` now supports a global option `datatable.unique.names` to control the creation of duplicate column names. Users can choose between `"off"` (default), `"warn"`, `"error"`, or `"rename"`. This addresses long-standing ambiguity issues when duplicate names were created silently, [#4044](https://github.com/Rdatatable/data.table/issues/4044). Thanks to @venom1204 for the PR.


I dont think we need this part since its superfluous "This addresses long-standing ambiguity issues when duplicate names were created silently"

ben-schwen · 2026-04-20T15:15:06Z

        table_name, brackify(duplicate_names), domain=NA)
 }

+process_name_policy = function(names_vec) {


Also since we only use it with setnames it should probably live inside data.table.R in front of setnames

apllied the changes as requested

c13642f

venom1204 requested a review from MichaelChirico as a code owner March 20, 2026 09:32

added test

488b873

lintr

471ab20

venom1204 requested a review from ben-schwen April 14, 2026 20:47

ben-schwen reviewed Apr 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce datatable.unique.names policy for duplicate handling in setnames() #4044#7674

Introduce datatable.unique.names policy for duplicate handling in setnames() #4044#7674
venom1204 wants to merge 3 commits intomasterfrom
issue4044

venom1204 commented Mar 20, 2026

Uh oh!

codecov bot commented Mar 20, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 20, 2026 •

edited

Loading

Uh oh!

ben-schwen Apr 20, 2026

Uh oh!

ben-schwen Apr 20, 2026

Uh oh!

ben-schwen Apr 20, 2026

Uh oh!

ben-schwen Apr 20, 2026

Uh oh!

ben-schwen Apr 20, 2026

Uh oh!

ben-schwen Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		5. `tables()` can now optionally report `data.table` objects stored one level deep inside list objects when `depth=1L`, [#2606](https://github.com/Rdatatable/data.table/issues/2606). Thanks @MichaelChirico for the report and @manmita for the PR

		6. `setnames()` now supports a global option `datatable.unique.names` to control the creation of duplicate column names. Users can choose between `"off"` (default), `"warn"`, `"error"`, or `"rename"`. This addresses long-standing ambiguity issues when duplicate names were created silently, [#4044](https://github.com/Rdatatable/data.table/issues/4044). Thanks to @venom1204 for the PR.

Conversation

venom1204 commented Mar 20, 2026

Uh oh!

codecov bot commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ben-schwen Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

ben-schwen Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

ben-schwen Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

ben-schwen Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

ben-schwen Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

ben-schwen Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Mar 20, 2026 •

edited

Loading

github-actions bot commented Mar 20, 2026 •

edited

Loading